Accurate Mass Determinations in Decay Chains with Missing Energy 
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Many beyond the Standard Model theories include a stable dark matter candidate that yields 
missing / invisible energy in collider detectors. If observed at the Large Hadron Collider, we must 
determine if its mass and other properties (and those of its partners) predict the correct dark matter 
relic density. We give a new procedure for determining its mass with small error. 
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One of the most dramatic possibilities for the Large 
Hadron Collider (LHC) is observation of events with large 
missing energy compatible with the production of a sta- 
ble, weakly-interacting particle that could explain the 
universe's relic dark matter content. Many beyond the 
Standard Model (SM) theories contain such a particle, 
denoted N . In particular, in the Minimal Supcrsym- 
metric Standard Model (MSSM) the lightest neutralino 
Xi is stable if i?-parity is conserved. Each LHC event 
must contain two N's that each emerge at the end of a 
chain decay. For example, in the MSSM, a large pro- 
duction rate is associated with squark pair, qq, produc- 
tion, and each q can have substantial probability to de- 
cay via q —> qx2 ~* Q^Xi = e^Mi 7 "); where 
X2 and I are the 2nd lightest neutralino and slepton, 
respectively. More generally, we will use the notation 
Z -> 7 + Y -> 7 + 5 + X -» 7 + 5 + 3 + l(= N), where par- 
ticles 7, 5 and 3 are Standard Model jets or leptons and 
Z, Y, and X are the intermediate on-shell resonances of 
the model in question. This event structure is illustrated 
in Fig. Q] This letter gives a procedure for accurately 
determining Mz, My, Mx and M/v for this topology. 
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FIG. 1: The event topology. 

Many mass determination procedures in the literature 
examine only one decay chain at a time 0, 0, H, 3 . This 
often does not allow one to solve for the event's miss- 
ing momenta. An exception is a very long decay chain 
starting from the gluino, as discussed in Ref. Q. How- 
ever, in the actual analysis the X2, ^ an d X? masses were 
assumed to be known and only the gluino and sbottom 
masses were fitted Q. Considering both decay chains 
simultaneously can potentially give us more information 
and allow a better determination of the masses [1, 0, S| • 
Our current procedure does this for the decay chains of 



Fig. [TJ If all particles can be correctly located on the de- 
cay chains and there are no experimental effects, then by 
considering two events we can solve for all the 4- momenta 
in both events and determine all the masses up to a dis- 
crete ambiguity. After examining a small number of event 
pairings, a unique solution will emerge. 

Assuming we can isolate LHC events with the topology 
in Fig. [T] and using mx = mx>, mx = mx', m Y = my, 
mz = niz>, we have the following constraints, 
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where pi is the 4- momentum for particle i (i — 1 ... 8). 
Since the only invisible particles are 1 and 2 and since 
we can measure the missing transverse energy, there are 
two more constraints: 



Pl+Pl=P X missi P V l+P2=Pr: 



(2) 



Given the 6 constraints in Eqs. ([TJ) and © and 8 un- 
knowns from the 4-momenta of the missing particles, 
there remain two unknowns per event. The system is 
under-constrained and cannot be solved. This situation 
changes if we use a second event with the same decay 
chains, under the assumption that the invariant masses 
are the same in the two events. Denoting the 4-momcnta 
in the second event as ^ (i = 1 ... 8), we have 8 more 
unknowns, q\ and 92, but 10 more equations, 
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(q2 + qi) 2 = (P2 +Pi) 2 , 
(qi + qi + qe) 2 = O2 + Pi + Pe) 2 , 
<?3 + 95 + q?) 2 = fe + qi + q& + q») 2 

= (P2 +Pi +P6 +P&) 2 , 



qi + q2= Qn 
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Altogether, we have 16 unknowns and 16 equations. The 
system can be solved numerically and we obtain discrete 
solutions for pi, p 2 , qi, q-i and thus the masses mjy, rnx, 
my, and mz- Note that the equations always have 8 
complex solutions, but we will keep only the real and pos- 
itive ones which we henceforth call "solutions" . Further 
details regarding practical and high-speed techniques for 
obtaining the solutions will appear in a future paper . 
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FIG. 2: We plot the number of mass solutions (in 1 GeV bins 
— the same binning is used for the other plots) vs. mass in 
the ideal case. All possible pairs for 100 events are included. 

For illustration and easy comparison to the litera- 
ture, we apply our method for the SUSY point, SPSla 
[llj], although many of the discussions below apply for 
generic cases. For SPSla^ the particles correspond- 
ing to N,X,Y,Z are jg, = e/n), xi 9l(<Z = 
d, u, s, c) respectively. The masses are {97.4, 142.5, 180.3, 
564.8/570.8} GeV, with the final two numbers corre- 
sponding to up/down type squarks respectively. Since 
?7!r ^ Tie^, the i = t case is an important background. 
We generate events with PYTHIA 6.4 0. 

We first consider the ideal case: no background events, 
all visible momenta measured exactly, all intermediate 
particles on-shcll and each visible particle associated with 
the correct decay chain and position in the decay chain. 
We also restrict the squarks to be up-type only. In this 
case, we can solve for the masses exactly by pairing any 
two events. The only complication comes from there be- 
ing 8 complex solutions for the system of equations, of 
which more than one can be real and positive. Of course, 
the wrong solutions arc different from pair to pair, but 
the correct solution is common. The mass distributions 
for the ideal case with 100 events are shown in Fig. [3] As 
expected, we observe (5-function-likc mass peaks on top 
of small backgrounds coming from wrong solutions. On 
average, there arc about 2 solutions per pair of events. 

The (^-functions in the mass distributions arise only 
when exactly correct momenta are input into the equa- 
tions we solve. To be experimentally realistic, we now 
include the following. 

1. Wrong combinations. For a given event a "com- 
bination" is a particular assignment of the jets and lep- 
tons to the external legs of Fig[TJ For each event, there 
is only one correct combination (excluding 1357 <-> 2468 
symmetry). Assuming that we can identify the two jets 
that correspond to the two quarks, we have 8 (16) pos- 
sible combinations for the 2//2e (4/j, or 4e) channel. The 
total number of combinations for a pair of events is the 
product of the two, i.e. 64, 128 or 256. Adding the wrong 
combination pairings for the ideal case yields the mass 
distributions of Fig. [3] Compared to Fig. [51 there are 
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FIG. 3: Number of mass solutions versus mass after including 
all combination pairings for 100 events. 

16 times more (wrong) solutions, but the (5-function-likc 
mass peaks remain evident. 

2. Finite widths. For SPSla, the widths of the 
intermediate particles arc roughly 5 GeV, 20 MeV and 
200 MeV for q^, xH an d Thus, the widths are quite 
small in comparison to the corresponding masses. 

3. Mass splitting between flavors. The masses for 
up and down type squarks have a small difference of 6 
GeV. Since it is impossible to determine flavors for the 
light jets, the mass determined should be viewed as the 
average value of the two squarks (weighted by the parton 
distribution functions). 

4. Initial/final state radiation. These two types of 
radiation not only smear the visible particles' momenta, 
but also provide a source for extra jets in the events. We 
will apply a px cut to get rid of soft jets. 

5. Extra hard particles in the signal events. 
In SPSla, many of the squarks come from gluino de- 
cay (<? ~~ * QQl)i which yields another hard q in the event. 
Fortunately, for SPSla rrtg — rriq L — 40 GeV is much 
smaller than m,q L — m^o = 380 GeV. Therefore, the q 
from squark decay is usually much more energetic than 
the q from g decay. We select the two jets with highest pr 
in each event after cuts. Experimentally one would want 
to justify this choice by examining the jet multiplicity 
to ensure that this analysis is dominated by 2-jet events, 
and not 3 or 4 jet events. Furthermore, the softer jets will 
be an indication of clearly separable mass-differences. 

6. Background events. The SM backgrounds are 
negligible for this signal in SPSla. There are a few sig- 
nificant backgrounds from other SUSY processes: 

(a) <tl — ► qx® ~* ~ * <Z TT Xi f° r one or both de- 
cay chains, with all r's decaying leptonically. Indeed, 
X2 — * tt has the largest partial width, being 14 times 
that of x_2 ~* However, to be included in our selec- 
tion the two r's in one decay chain must both decay to 
leptons with the same flavor, which reduces the ratio. A 
cut on lepton pr also helps to reduce this background, 
since leptons from r decays are softer. Experimentally 
one should perform a separate search for hadronically de- 
caying tau's or non-identical-flavor lepton decay chains to 



3 




mass (GeV) 

FIG. 4: Mass solutions with all effects 1-7 included and after 
cuts I - III for the SPSla SUSY model and L = 300 ftr 1 . 

explicitly measure this background. 

(b) Processes containing a pair of sbottoms, especially 
b\. In SPSla the first two generations of squarks are 
nearly degenerate. In any model, they must be discov- 
ered in a combined analysis since light quark jets are 
not distinguishable. Well-separated squark masses would 
show up as a double peak structure in Mz- However b 
jets arc distinguishable and a separate analysis should 
be performed to determine the b squark masses. This 
presents a background to the light squark search since 
^-tagging efficiency is only about 50% at high px- 

(c) Processes that contain a pair of xlz's, n ot both com- 
ing from squark decays. For these events to fake signal 
events, extra jets need to come from initial and/or final 
state radiation or other particle decays. For example, di- 
rect %2 P a i r production or x° + 9 production. These are 
electroweak processes, but, since X2 nas a much smaller 
mass than squarks, the cross-section is not negligible. In 
our SPSla analysis, the large jet px cut reduces this kind 
of background due to the small rrig — m,q L . 

7. Experimental resolutions. In order to estimate 
this experimental effect at the LHC, we process all events 
with ATLFAST0, a fast simulation package of the AT- 
LAS detector. Since we assume 300 fb^ 1 integrated lu- 
minosity, we run ATLFAST in the high luminosity mode. 

The cuts used to isolate the signal are: 

I) 4 isolated leptons with pr > 10 GeV, \r/\ < 2.5 and 
matching flavors and charges consistent with our assumed 
X° -> t-> Xi decay; 

II) No 6-jets and > 2 jets withp T > 100 GeV, \r)\ < 2.5. 
The 2 highest-pT jets are taken to be particles 7 and 8; 

III) Missing p T > 50 GeV. 

For a data sample with 300 fb^ 1 integrated luminosity, 
there are about 1050 events left after the above cuts, out 
of which about 700 are signal events. After taking all 
possible pairs for all possible combinations and solving 
for the masses, we obtain the mass distributions in Fig. [4] 
Fitting each distribution using a sum of a Gaussian 
plus a (single) quadratic polynomial and taking the maxi- 
mum positions of the fitted peaks as the estimated masses 
yields {77.8, 135.6, 182.7, 562.0} GeV. Averaging over 



10 different data samples, we find 

m N = 76.7 ±1.4 GeV, m x = 135.4 ± 1.5 GeV, 
my = 182.2 ± 1.8 GeV, m z = 564.4 ± 2.5 GeV. 

The statistical uncertainties are very small, but there ex- 
ist biases, especially for the two light masses. In practice, 
we can always correct the biases by comparing real data 
with Monte Carlo. Nevertheless, we would like to reduce 
the biases as much as possible using data only. In some 
cases, the biases can be very large and it is essential to 
reduce them before comparing with Monte Carlo. 

The combinatorial background is an especially impor- 
tant source of bias since it yields peaked mass distribu- 
tions that are not symmetrically distributed around the 
true masses, as can be seen from Fig. |3l This will intro- 
duce biases that survive even after smearing. Therefore, 
we concentrate on reducing wrong solutions. 

First, we reduce the number of wrong combinations by 
the following procedure. For each combination choice, c, 
for a given event, i (i = l,N evt ), we count the number, 
N pa i r (c, i), of events that can pair with it (for some com- 
bination choice for the 2nd events) and give us solutions. 
We repeat this for every combination choice for every 
event. Neglecting effects 2.- 7., N pair (c,i) — N evt — 1 
if c is the correct combination for event i. After includ- 
ing backgrounds and smearing, N pa i r {c,i) < N evt — 1, 
but the correct combinations still have statistically larger 
N pa ir(c,i) than the wrong combinations. Therefore, 
we cut on N pa i r (c,i). For the SPSla model point, 
if N pa i r (c,i) < 0.75 N ev t we discard the combination 
choice, c, for event i. If all possible c choices for event 
i fail this criterion, then we discard event i altogether 
(implying a smaller N evt for the next analysis cycle). We 
then repeat the above procedure for the remaining events 
until no combinations can be removed. After this, for the 
example data sample, the number of events is reduced 
from 1050 (697 signal + 353 background) to 734 (539 
signal + 195 background), and the average number of 
combinations per event changes from 11 to 4. 

Second, we increase the significance of the true solu- 
tion by weighting events by 1 /n where n is the number of 
solutions for the corresponding pair (using only the com- 
bination choices that have survived the previous cuts). 
This causes each pair (and therefore each event) to have 
equal weight in our histograms. Without this weighting, 
a pair with multiple solutions has more weight than a pair 
with a single solution, even though at most one solution 
would be correct for each pair. 

Finally, we exploit the fact that wrong solutions and 
backgrounds are much less likely to yield M x , Mx, My, 
and Mz values that are all simultaneously close to their 
true values. We plot the 1/n- weighted number of solu- 
tions as a function of the three mass differences (Fig. [5]). 
We define mass difference windows by 0.6 x peak height 
and keep only those solutions for which all three mass 
differences fall within the mass difference windows. The 
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FIG. 5: SPSla, L = 300 fb" 1 mass difference distributions. 
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FIG. 6: Final mass distributions after the bias-reduction pro- 
cedure for the SPSla SUSY model and L = 300 fb" 1 . 

surviving solutions arc plotted (without the 1/n weight- 
ing) in Fig. [6] Compared with Fig. [4] the mass peaks are 
narrower, more symmetric and the fitted values are less 
biased. The fitted masses are {91.7,135.9, 175.7 558.0} 
GeV. Repeating the procedure for 10 data sets, we find 



m N = 94.1 ± 2.8 GeV, 
my = 179.0 ± 3.0 GeV, 



m x = 138.8 ±2.8 GeV, 
m z = 561.5 ±4.1 GeV. 



Thus, the biases are reduced at the cost of (slightly) in- 
creased statistical errors. 

We have applied our method to other mass points to 
show its reliability. Details will be presented in [9j. We 
quote here results for "point 1" defined in Ref. [6j with 
the following masses: {85.3, 128.4, 246.6, 431.1/438.6} 
GeV. For 100 fb" 1 data, we have about 1220 events (1160 
signal events) after the pre-bias-reduction cuts. After 
following a bias reduction procedure and using 10 data 
samples, we obtain mji = 85±4 GeV, mx = 131±4 GeV, 
ray = 251 ± 4 GeV, m z = 444 ± 5 GeV. 

We emphasize that the remaining biases in the above 
mass determinations can be removed by finding those in- 
put masses that yield the observed output masses after 
processing Monte Carlo generated data through our pro- 
cedures. In this way, very accurate central mass values 
are obtained with the indicated statistical errors. 

The above results for the N, Y and X masses for the 
SPSla point and point #1 can be compared to those 
obtained following a very different procedure in Ref. 
There, only the X — > Y — > N parts of the two decay 



chains were employed and we used only 4/z events. For 
the SPSla model point we obtained mjv = 98 ± 9 GeV, 
m Y = 187 ± 10 GeV, and m x = 151 ± 10 GeV. And, 
for point #1 wc found mx = 86.2 ± 4.3 GeV, m x = 
130.4±4.3 GeV and m Y = 252.2±4.3 GeV. Including the 
4e and 2/j,2e channels will reduce these errors by a factor 
of ~ 2. The procedure of @ can thus be used to verify the 
results for n%N, mx and my from the present procedure 
and possibly the two can be combined to obtain smaller 
errors than from either one, with mz determined by the 
procedure of this letter. 

Overall, we have obtained a highly-encouraging level 
of accuracy for the mass determinations in events with 
two chains terminating in an invisible particle. Once the 
masses are known with this level of accuracy, the next 
step will be to examine detailed distributions for various 
possible models (MSSM, little-Higgs, Universal Extra Di- 
mensions), assuming the determined masses and keeping 
only solutions for each event consistent with them. The 
different models can be expected to predict sufficiently 
distinct distributions (for the same mass choices) that 
the precise nature of the invisible particle can be de- 
termined. We will then be able to make fairly precise 
predictions for its relic density and check for consistency 
with observation. Showing that the dark matter particle 
as observed at the LHC predicts a relic density consistent 
with cosmological observations would resolve one of the 
most important issues of modern-day physics. 

This work was supported in part by U.S. Department 
of Energy grant No. DE-FG03-91ER40674. 
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